Phylogenetic logistic regression for binary dependent variables.

نویسندگان

  • Anthony R Ives
  • Theodore Garland
چکیده

We develop statistical methods for phylogenetic logistic regression in which the dependent variable is binary (0 or 1) and values are nonindependent among species, with phylogenetically related species tending to have the same value of the dependent variable. The methods are based on an evolutionary model of binary traits in which trait values switch between 0 and 1 as species evolve up a phylogenetic tree. The more frequently the trait values switch (i.e., the higher the rate of evolution), the more rapidly correlations between trait values for phylogenetically related species break down. Therefore, the statistical methods also give a way to estimate the phylogenetic signal of binary traits. More generally, the methods can be applied with continuous- and/or discrete-valued independent variables. Using simulations, we assess the statistical properties of the methods, including bias in the estimates of the logistic regression coefficients and the parameter that estimates the strength of phylogenetic signal in the dependent variable. These analyses show that, as with the case for continuous-valued dependent variables, phylogenetic logistic regression should be used rather than standard logistic regression when there is the possibility of phylogenetic correlations among species. Standard logistic regression does not properly account for the loss of information caused by resemblance of relatives and as a result is likely to give inflated type I error rates, incorrectly identifying regression parameters as statistically significantly different from zero when they are not.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Phylogenetic Regression for Binary Dependent Variables

We compare three methods for phylogenetic regression analyses designed for binary dependent variables (traits with two discrete states) both with each other and with ‘‘standard’’ methods that either ignore phylogenetic relationships or ignore the binary character of the dependent variable. In simulations designed to reveal statistical problems arising in different methods, PLogReg (Ives and Gar...

متن کامل

t " THE PRACTICAL VALUE OF LOGISTIC REGRESSION

Logistic multiple regression using the method of maximum likelihood is now the method of choice for many regression-type problems involving binary, ordinal. or nominal dependent variables. Logistic regression does not require grouping of observations to obtain valid estimates of effects and of outcome probabilities, and it has been shown in the binary case to provide more accurate probability e...

متن کامل

به کارگیری مدل‌های رگرسیون لجستیک ترتیبی در مطالعات کیفیت زندگی

 Background & Objectives: Due to the increasing tendency to measure the quality of life in recent years and the extensive quality of life questionnaires, it is important to determine the appropriate method of analyzing data derived from these studies. The aim of the present study was to introduce ordinal logistic regression models as an appropriate method for analyzing the data of quality of li...

متن کامل

A New Nonlinear Specification of Structural Breaks for Money Demand in Iran

In a structural time series regression model, binary variables have been used to quantify qualitative or categorical quantitative events such as politic and economic structural breaks, regions, age groups and etc. The use of the binary dummy variables is not reasonable because the effect of an event decreases (increases) gradually over time not at once. The simple and basic idea in this paper i...

متن کامل

Statistical notes for clinical researchers: logistic regression

https://rde.ac Logistic regression is a regression model where the dependent variable is categorical and corresponding independent variables can be categorical or continuous. This article covers the case of a binary dependent variable such as an event occurring coded 1 = ‘event’ and 0 = ‘no event’. Frequent outcomes are pass/fail, win/lose, disease/no disease, etc. The logistic regression model...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Systematic biology

دوره 59 1  شماره 

صفحات  -

تاریخ انتشار 2010